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Abstract 

This work develops the mathematical methods underlying a recent 
quantum feedback experiment stabilizing photon-number states. It con- 
siders a controlled system whose quantum state, a finite dimensional den- 
sity operator, is governed by a discrete-time nonlinear Markov process. In 
open-loop, the measurements are assumed to be quantum non-demolition 
(QND) measurements. This Markov process admits thus a set of sta- 
tionary pure states associated to an orthonormal basis of the underlying 
Hilbert space. These stationary states provide martingales that are cru- 
cial to characterizing the open-loop stability: under simple and suggestive 
assumptions, almost all trajectories converge to one of these stationary 
states; the probability to converge to a stationary state is given by its 
overlap with the initial quantum state. From these open-loop martin- 
gales, we construct a supermartingale whose parameters are given by in- 
verting a Metzler matrix characterizing the impact of the control input on 
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the Kraus operators defining the Markov process. This supermartingale 
measures the "distance" between the current quantum state and the goal 
state chosen from one of the open-loop stationary pure states. At each 
step, the control input minimizes the conditional expectation of this su- 
permartingale. It is proven that the resulting feedback scheme stabilizes 
almost surely towards the goal state whatever the initial quantum state. 
This state feedback takes into account a known constant delay of arbitrary 
length in the control loop. This control strategy is proved to remain also 
convergent when the state is replaced by its estimate based on a quantum 
filter. It relies on measurements that can be corrupted by random errors 
with conditional probabilities described by a known left stochastic matrix. 
Closed-loop simulations corroborated by experimental data illustrate the 
interest of such nonlinear feedback scheme for the photon box, a cavity 
quantum electro-dynamics system. 

1 Introduction 

Manipulating quantum systems allows one to accomplish tasks far beyond the 
reach of classical devices. Quantum information is paradigmatic in this sense: 
quantum computers will substantially outperform classical machines for several 
problems |13j . Though significant progress has been made recently, severe diffi- 
culties still remain, among which decoherence is certainly the most important. 
Large systems consisting of many qubits must be prepared in fragile quantum 
states, which are rapidly destroyed by their unavoidable coupling to the en- 
vironment. Measurement-based feedback and coherent feedback are possible 
routes towards the preparation, protection and stabilization of such states. For 
coherent feedback strategy, the controller is also a quantum system coupled to 
the original one (see [8j [10] and the references herein) . This paper is devoted 
to measurement-based feedback where the controller and the control input are 
classical objects [3U]- The results presented here are directly inspired by a re- 
cent experiment [161 115] demonstrating that such a quantum feedback scheme 
achieves the on-demand preparation and stabilization of non-classical states of 
a microwave field. 

Following [7 and relying on continuous-time Lyapunov techniques exploited 
in [T2], an initial measurement-based feedback was proposed in [BJ. This feed- 
back scheme stabilizes photon-number states (Fock states) of a microwave field 
(see e.g. [S] for a physical description of such cavity quantum electro-dynamics 
(CQED) systems). The controller involves firstly a quantum filter estimat- 
ing the state of the field from discrete-time measurements performed by probe 
atoms, and secondly a stabilizing state-feedback control relying on Lyapunov 
techniques. The discrete-time behavior was crucial for a possible real-time im- 
plementation of such kind of controllers. Closed-loop simulations reported in JBJ 
have been confirmed by the stability analysis performed in [2]. In the exper- 
imental implementation [16j [15], the state- feedback has been improved by a 
better choice of Lyapunov function. The goal of this paper is to present, for 
a class of discrete-time quantum systems, the mathematical methods undcrly- 
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ing such improved Lyapunov design. Our main result is given in Theorem 13.21 
where closed-loop convergence is proved in presence of delays and measurement 
imperfections. 

The state-feedback scheme may be applied to generic discrete-time finite- 
dimension quantum systems using controlled measurements in order to deter- 
ministically prepare and stabilize the system at some pre-specified goal state. 
These systems are governed by nonlinear controlled Markov processes where 
the jumps are measured and the jump probabilities are state-dependent. These 
systems are subject to a discrete-time sequence of positive operator valued mea- 
sures (POVMs 01]) and we use these POVMs to stabilize the system at the 
goal state. By controlled measurements, we mean that at each time step the 
chosen POVM is not fixed but is a function of some classical control signal u, 
similarly to |19) . We however assume that when the control u is zero, the cho- 
sen POVM performs a quantum non-demolition (QND) measurement [HI [20] for 
some orthonormal basis that includes the goal state. The feedback-law is based 
on a set of Lyapunov functions made of linear combination of the martingales 
attached to these QND measurements and that determines a "distance" be- 
tween the goal state and the current state. The parameters of these Lyapunov 
functions are given by inverting Mctzler matrices characterizing the impact of 
the control input on the Kraus operators defining the Markov processes and 
POVMs. The (graph theoretic) properties of the Metzler matrices are used 
to construct families of open-loop supermartingales that become strict super- 
martingales in closed-loop. This fact provides directly the convergence to the 
goal state without using invariance principle. 

A common problem that occurs in quantum feedback control is that of delays 
between the measurement process and the control process [14] . In this paper 
we demonstrate, using a predictive quantum filter, that the proposed scheme 
works even in the presence of delays. Convergence analysis is done for perfect 
and imperfect measurements. For imperfect measurements, the dynamics of the 
system are governed by a nonlinear Markovian chain given in [17) . In both the 
perfect and imperfect measurement situations we recover the quantum separa- 
tion principle as originally stated in [5]: if the quantum filter associated with 
the measurement process converges to the goal (pure) state then so does the 
actual state of the system. This separation principle implies that the feedback 
may be designed only knowing the state of the quantum filter and measurement 
outcomes in spite of not knowing the actual state of the system due to the 
imperfections in measurement. 

The paper is organized as follows. In section [21 we describe the finite di- 
mensional Markovian model together with the main modeling assumptions in 
the case of perfect measurements. Subsection 12.21 is devoted to the open- loop 
behavior (Theorem 12. ip which can be seen as a non-deterministic protocol for 
preparing a finite number of isolated and orthogonal quantum states. In subsec- 
tion [231 we present the main ideas underlying the construction of these control- 
Lyapunov functions W t : a Metzler matrix attached to the second derivative of 
the measurement operators and a technical lemma assuming this Meztler matrix 
is irreducible. Finally Theorem 12.21 describes the stabilizing feedback derived 
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from W e . The same analysis is done for the case of imperfect measurements in 
Section [3] We propose a brief discussion on the considered quantum filter and 
by proving a rather general separation principle (theorems 12.31 (resp. 13.2] )) for 
perfect measurements (resp. for imperfect measurements)). Sectionals devoted 
to the experimental implementation that has been done at Laboratoire Kastler- 
Brossel at Ecole Normale Superieure. Closed-loop simulations and experimental 
data complementary to those reported in [161 115] are presented. 

2 Perfect measurements 
2.1 The nonlinear Markov model 

We consider a finite dimensional quantum system (the underlying Hilbert space 
H = C d is of dimension d > 0) being measured through a generalized measure- 
ment procedure at discrete time intervals. The dynamics of such discrete time 
quantum systems are described by a nonlinear controlled Markov chain with a 
structure derived from quantum physics. We just sketch here this structure from 
a mathematical viewpoint. In this Section, we suppose perfect measurements 
which means that the detectors are ideal and also that there is no decoherence. 

The system state is described by a density operator p belonging to T> the set 
of positive, Hermitian matrices of trace one: 

V:={peC dxd \ p = p\ Tr(p) = l, p > 0}. 

To each measurement outcome p G {1, . . . ,m}, m > being the numbers of 
possible outcomes, is attached the Kraus operator S <£dxd Spending on p 
and also on a scalar control input u £ R. For each it, these Kraus operators 
(M^^gj! satisfy the constraint J2™=i M% = I where I is the identity 
matrix. They are attached to the Kraus map K" defined by 

m 

V3p^ K u (p) = ]T MlpMf e V. (1) 

The random evolution of the state Pk € T> at time step k is modeled through 
the following Markov process: 

Ph+1 =M% r r(j> k ), (2) 

where 

• itfc_ T is the control at step fc, subject to a delay of r > steps. This delay 
is usually due to delays in the measurement process that can also be seen 
as delays in the control process, 

• pk is a random variable taking values p in {1, • • • , m} with probability 

PZ7 = r ft(A^*- T p fc A^*- Tt ), 
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• for each p, MJJ is the super-operator 

M u ■ n i ^ M > M ^" p T> 

^ ^ Tr(M-pM ?t ) 6 P 

defined for p G X> such that p£ p = Tr hd^pM^ ^ 0. 

We state below three assumptions that we will use in the paper. 

Assumption 1. For u = 0, the measurement operators Mf] are diagonal in the 

same orthonormal basis { \n) \ n € {1, • • • , d}}, therefore = J^ n _ 1 c^.n |rt) (n\ 
with c^ n G C. 

Assumption 2. For aZ/ ni ^ ri2 in {1, •■• , <i}, there exists p G {!,••• , Tn} 
such that \c^ ni \ 2 ^ \ c ^,n 2 \ 2 - 

Assumption 3. The measurement operators Af" are C 2 functions of u. 

Assumption Q] means that when the control is zero, the considered measure- 
ment process achieves a Quantum Non Demolition (QND) measurement for the 
physical observables given by projection operators over the states ||n) | n G 
{1, • ■ • This implies that for = 0, any p — \n) (n\ corresponding to the 

orthogonal projector on the basis vector \n), n G {1, ...,d}, is a fixed point of 
the Markov process ©. Since the operators M" must satisfy Y^u=i -^S = I; 
we have, according to assumption[TJ that 53u=i I c m,«| 2 = 1 for all n G {1, • • • , d}. 

Assumption [5] means that there exists a p such that the statistics when 
Uk = for obtaining the measurement result p are different for the fixed points 
|ni)(ni| and \n 2 ) (n 2 |. This follows by noting that Tr \ M® \n) (n\ M®^ 
\c^ n \ 2 for n G {1, • • • ,d}. 



2.2 Convergence of the open-loop dynamics 

When the control input vanishes (it = 0), the dynamics are simply given by 

Pfe+1 =M°> fc ) (3) 

where pk is a random variable with values in {1, ... , m}. The probability p° Pk to 
have pk = p depends on p^: P°. Pfc = Tr (^M^pkM^^j . The open-loop asymptotic 
behavior is characterized by the following theorem. 

Theorem 2.1. Consider a Markov process pk obeying the dynamics of ([3]) with 
an initial condition po in T>. Then 

• with probability one, pk converges to one of the d states \n) (n\ with n G 

• the probability of convergence towards the state \n) (n\ depends only on the 
initial condition po and is given by (n\ po \n) . 
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This theorem is already proved in [2, 3 and also in [4 in a slightly differ- 
ent formulation. We recall the proof proposed in [3] since it uses a Lyapunov 
function that will be exploited in closed-loop. 

Proof. For any n £ {1, . . . , d}, (n\ p \n) is a martingale. This results from 

m 

E «n| Pk+l \n) \ Pk ) = £ Tr (Kp* M f) ( n \ M °M) N 



= J2 h Kp*< i«> = H E K M > i«) = («i p* i") . 
i i 

where we have used that M® and |n) (n| commute and Y^u=i = I. 

Take the function 

nrti= _^(M^>)!. (4) 

n=l 

The function 1/ being concave and each (n| p |n) being a martingale, we infer 
that V(p) is a supermartingale 

E(v/(p fc+1 )| Pfe )<y(p fe ). 

More precisely, we have 

0t\ / (n\M°p k M° r \n) 



with S° fc = | /X e {1, • • • , m} | Tr (M°p fc M° t ) ^ o|. We have the identity 



Tt 



(MOp fc AfOt) 



where we have used E M£ s° fc Tr (M°p k M°^ = 1. 

The above identity yields E (V(pfc+i)l/°fc) — V{pk) = —Q(pk) with 

w = 2 E E ^(^^>(^/^Vfe^^-^^) a 

We have used that (n\ M° (p) |n) = ^jggj and ("I MjpJ< |n) = 

(n|p|n) . Since Q > 0, V^(pfc) is a supermartingale. Note that the sum in the 



G 



definition of Q(p) is over all fi, v € S®. However, we can assume that the sum 
is actually over all p,, v in {1, • • • , m} by observing the following facts. For any 
p, v, take the mapping 



p h. Tr (M°pM° f ) Tr (M°pM° 



Ot^ f \c \ 2 (n\p\n) _ |c | 2 (n|p|n) 
^(A/Op^/ot) Tr(Af£pA/° + ) 



defined only when Tr (M^pM^ J Tr [M^pM^ J > 0. Since this mapping is pos 
itive and bounded by p ^ Tr ( M°pM^J Tr (M°pA/° f ) , it can be extended by 
continuity to any p € V by taking a null value when Tr (M°pAf° f ) Tr ^M°pM° t 



dm , ^ 2 



0. Thus 

dm , 

_ 1 rp. / »^0_»^-0t\ m. / »^0„,, r 0t\ / Jc 

^(AfOpMjJt) Ti^A/gpA/gt) 

n=l fi : v= 1 ^ 

(5) 

is continuously defined for any pel? and still satisfies 

E(V(p fe+1 )|p fe ) - y(p fe ) = -Q(p fc ),Vp fe e p. 

By Theorem IA.1I of the Appendix, the w-limit set is a subset of {p e 
2?| Q(p) = 0}. The condition Q — implies that, for all n, p, 

Tr (M°pM° f ) (n| M°pM° f |n) = Tr (M>M° f ) (n| M°pM° f |n). 

Taking the sum over all v, we get for all n and p, 

(n\ M>M° f |n) = Tr (M°pM° f ) (n| p |n> 

These relations read {p n ',n' = (n'\ p \n')) 

|Cp,n| Pn,n — ^ jCp^n'l Pn',n'^ Pn,n- (6) 

Since Tr(p) = 1 and p > 0, there exists n* € {1, ...,d} such that p n *, n * > 
0. Thus ((6]) with arbitrary p and n = n* gives |c Mi „*| 2 = J^ n , |c Min '| 2 p„' !n '. 
Consequently for all p and n, (j6|) reads 

(\c^,n\ 2 ~ \c fi ,„*\ 2 )p„, n = 0. 

By assumption[2j p„.„ = as soon as n ^ n* and thus p = |n*) (n*\. This closes 
the proof of the first assertion. 

We have shown that the probability measure associated to the random vari- 
able pk converges weakly to the probability measure Yln=i Pn^\n)(n\ > where 
^|Ti)(n| denotes the Dirac measure at \n) (n\ and p n is the probability of con- 
vergence towards |n) (n| . In particular, we have E ((n| pk \n)) — > p n . But 
(n| pk \n) is a martingale thus E {{n\ pk \n)) = E((n|po|n)) and consequently 
Pn = (n\ po \n). □ 



7 



2.3 Feedback stabilization 



2.3.1 Design of the control- Lyapunov function 

Theorem l2 . 1 1 means that the open-loop dynamics is stable in the sense that each 
realization pk converges to one of the pure state |n) (n\ where n e {l,...,d} 
is stochastic with probability equals to (n| po \n) since (n\ p \n) is a martingale. 
The control goal is to made this convergence deterministic toward a chosen 
n E {l,...,d} playing the role of controller set point. The control design is 
based on Lyapunov techniques exploiting the d open- loop martingales (n\ p \ n). 
In [1 [2] we take 

1 - {n\ p \n) = ( n \ P \ n ) 

as control Lyapunov function and propose a state feedback law transforming 
this open- loop martingale into a closed-loop supermartingale. In [3j [18], the 
feedback relies on Lyapunov functions Vo(p) of the form 

d 

Vo(p) =^2(T n (n\p\n) 

n=l 

where the real coefficients a n are strictly positive except er fi = 0. Since the a n are 
all different in general, such Lyapunov functions achieve a better discrimination 
between the open- loop fixed points |n) {n\: a large a n indicates that the control 
effort needed to steer \n) (n\ towards \n) (n\ will be important. We generalize 
here this Lyapunov design for arbitrary controlled Kraus operators that 
cannot be decomposed into QND measurement operators M° followed by a 
unique controlled unitary operator D u with Dq — I as assumed in [3] I18j where 

i)„m;;. 

More precisely, these positive coefficients have to ensure that for any n ^ n, 
the second-order w-derivative of Vb(lK u (|n) (n|)) at u = is strictly negative (K u 
is the Kraus map defined by (HJ). This yields to a set of linear equations in a n 
that can be solved by inverting an irreducible Metzler matrix (see lemma [272]) . 
Once the cr n 's satisfy these constraints, the stabilizing feedback law proposed in 
theorem 12.21 is based on the open-loop supermartingale 

d 

K(p) = K,(p)-f^((n|p|n)) 2 

n=l 

where the sum of quadratic terms has been previously used as supermartingale 
during the proof of the open-loop convergence addressed by theorem 12.11 For 
e > small enough, V e (p) still admits a unique global minimum at p = |n) (n|; 
for u close to 0, u H V e (K u (\n) (n\)) is strongly concave for any n ^ n and 
strongly convex for n = n. This indicates that V e is a good candidate as control- 
Lyapunov function. 

It remains to explain how to take into account the delay r explicitly into the 
Lyapunov design relying of V e (p). Take {pk, u/c-i, ■ ■ • , u/c-r) as state at step k. 
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More precisely denote by x = G°> Pi, ■ • • > Pt) this state where 0i stands for the 
control input u delayed I steps. Then the state form of the delay dynamics ([2} 
is governed by the following Markov chain 



Pk+i 



P2,fc+1 = Pl,k 
, PT,fc+l = Pr—\,k' 



(7) 



The goal is to design a feedback law u& = f(xk) that globally stabilizes this 
Markov chain towards a chosen target state x = (1^) (^| > 0, • • • , 0) for some 
n £ {1, • • • , d}. In theorem [2?2] we show how to design a feedback relying on the 

control Lyapunov function W e (x) = V t {K f3l {K^{ K^(p).. .)))• 

The construction of Vo(p) relies on the following lemmas. 



Lemma 2.1. Consider the d x d Metzler matrix R defined by 

i 



Rni ,r. 



A* 



"2 



2(5 nii „ 2 3?(( 



H,ni 



"1 



1^2) 



When R^= 0, the non negative P = I R/Tr(R) is a right stochastic matrix. 

Proof. For m =/= ri2, R ni ,n 2 0. Thus i? is a Metzler matrix. Let us prove 
that the sum of each row vanishes. This results from identity MJf'M™ = I. 
Deriving twice versus u the relation 

1 = Y, (m\MfMZ |m) = ]T (m| Mf |n 2 ) (n 2 | |m) 



yields 



P-,n 2 



m 



|n 2 ) (n 2 | 



du 



du 



m) + <m|^^|n 2 ) (najA^K) 



+ (m|M«t|n 2 ) (n 2 |^S|m} = 



for u = 0, 



E 



„ 2 Rn u n 2 ■ 



(n 2 \M°\ni) = 5 ni ,„,c, 



the above sum corresponds to 



Therefore, the diagonal elements of R are non positive. If R ^ 0, 
then Tr (i?) < and the matrix P = I — i?/Tr (i?) is well defined with non- 
negative entries. Since the sum of each row of R vanished, the sum of each row 
of P is equal to 1. Thus P is a right stochastic matrix. □ 



Lemma 2.2. Assume the graph of the matrix R defined in Lemma \2.1\ is strongly 
connected, i.e. for any n,n' £ {1, . . . ,d}, n ^ n' , there exists a chain of r 
distinct elements (nj)j—i >r of {1, . . . , d} such that m = n, n r — n' and for 
any j = 1, . . . r — 1, R nj ,n j+1 ^ 0. Take n £ {1, . . . , d}. Then exist d— 1 strictly 
positive real numbers e n , n £ {1, . . . ,d}/{n}, such that 
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for any reals X n , n G {1, . . . , d}/{n}, there exists a unique vector a — 
(°Vi)ne{i, ■ ■■,d} of R d with an — such that Ra — A where A is the vector 
ofM. d of components A„ for n G {1, . . . , d}/{fi} and A ft = — J2n^n e n^ni 
if additionally A„ < for all n G {1, . . . , d}/{fi}, then a n > for all 
n G {l,...,d}/{n}. 



• for any vector a G M. d , solution of Ra = A G 



the function Vo(p) 



En=i Vn{n\p \n) satisfies 

d 2 V (K u {\n) (n|)) 



du 2 



Vn G {l,...,d}. 



Proof. Since the graph of R coincides with the graph of the right stochastic 
matrix P defined in Lemma 12.11 P is irreducible. Since it is a right stochastic 
matrix, its spectral radius is equal to 1. By Perron- Frobenius theorem for 
non- negative irreducible matrices, this spectral radius, i.e. 1, is also an eigen- 
value of P and of P T , with multiplicity one and associated to eigen- vectors 
having strictly positive entries: the right eigen-vector (Pw = w) is obviously 
w = (1, . . . , 1) T ; the left eigen-vector e = (ex, ... , e„) T (P T e = e) can be chosen 
such that en = 1. Consequently, the rank of R is d — 1 with ker(i?) = span(w) 
and Im(i?) = e 1 - where e 1 - is the hyper-plane orthogonal to e. Since e T A = 0, 
A G Im(i?), exists a such that Ra = A. Since ker(i?) = span(u>), there is a 
unique a solution of Ra = A such that a n = 0. The fact that a n > a n when 
A„ > for n ^ n, comes from elementary manipulations of Pa = a — A/Tr (R) 
showing that min„^ fi a n > cr fi . 

For any n, set = V (K u (\n) (n|)) = £, <n (l\^ u (\n) (n\) \l) . Set P? := 
(l\K u (\n) (n\) \l) , we have 



du 



= E ^? i") ("i M ; f io + w k n h ^ io 



and 



-Q=E(<^L=ol«)HM° t |0 



+ <n 3?U 1") («i ^L= io + <*i K i»> H ^L= io 

2 



=£ 2 



W^LolO 



+ 2<5 ni 3J(c M ,„ (n|^| u=0 |0) 

d 2 y (K"(|«)(n|)) 



Therefore ^p-|„=o = i?ru and 



□ 



u=0 
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2.3.2 The global stabilizing feedback 

Theorem 2.2. Consider the Markov chain ([7]) with assumptions^ [H and\^ 
Take n £ {1, ...,d} and assume that the graph of the Metzler matrix R of 
lemma [Ql is strongly connected. Take e > 0, a £ the solution of Ra = 
A with a n = 0, X n < for n e {1, . . . ,d} /{n}, \ n = -X) n ^ ft e n A„ (see 
lemma WJfy and consider 

d 

V e {p) = ^n{n\p\n)-^{n\p\n)f. 

71=1 

Take u > and consider the following feedback law 

u k = argmin(E(W<:(xk+i)\Xk,u k = £))=: f( Xk ) (8) 

where iy e (x) = K(K ft (K & ( K^(p) . . .))). 

XTien exists u* > swc/i i/ia£, /or ai/ u e]0, u*\ and e G 0, min n ^ft ( IT 22- ) > 
i/ie closed loop Markov chain of state Xk with the feedback law (J5J converges al- 
most surely towards (|n) (n| ,0, • • ■ ,0) /or anj/ initial condition xo € Px [— u, m] t . 



Proof. For the sake of simplicity, first we demonstrate this theorem for r = 1 
and thus for x = iPiPl)- We then explain how this proof may be extended to 
arbitrary r > 1. 
We have 



E(W £ (xfc+i)|Xfc:Wife) =^p^V;(K Mfc (M^ 1,fc (p fe ))) . 



Thus E(W e (xk + i)\Xk,u k ) = -Qi(Xfe) + E^P^(K°(M^(p fc ))) where 



wo 



have set 



Ql(Xfc)=E^MK (M^> fe )))- min V^t/ £ (K«(M^' fc (p fc ))) (9) 



f6[-u,u] 



and we clearly have Qi > 0. 

Using V e (K°(p)) = V e {p) for any p e V and E M ^^(]K (M^' fc (p fe ))) 
^(K^(p fe ))-Q 2 (Xfe) with 



n— 1 /i,z^ 



H MfrpM^ \n) (nj M^pM^ \n) 
Tr (M^ 1 pM^ 1 f ) Tr (m^ 1 pM^ 1 1 



(10) 



we have 

E (W e (x*+i)|x*. «k) = ^e(Xfe) - Qi(Xfe) - Q2(Xk) (11) 
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where the continuous and non- negative functions Q\ and Qi are defined by (O 
and (|10p . W e (xk) is thus a supermartingale. According to Theorem IA.11 the 
w-limit set 1^ , of \k satisfies 



/»c{(p,ft)eDx [-u,u] I Qi(p,/3i) = 0, Q 2 (p,/3i) = 0}. 



We now show in three steps that loo only consists of the target Fock state. 
Step 1. For all 5 > 0, there exists u > small enough such that 



Proof of step 1. Step 1 follows from Q%{p) = for all (p, /3i) € loo and 
Lemma 12.31 

Lemma 2.3. Consider the function Q2 defined by (|10[) and u > 0. TTien i/iere 
exists C > smc/i i/iai /or a// {p, Pi) G I? x [— u, u] satisfying Q2(p,/3i) = 0, 
£/iere exists n € {1, . . . , d} swc/i £/ia£ p n) „ = (n| p |n) > 1 — C|/3i|. 

A proof of this technical lemma is given in the appendix. 
Step 2. There exist 8' > and u > 0, small enough, such that 



Pi is continuous from Assumption [3] Therefore, there exist S',u > such that 




n=l 



loo C {(p, Pi) eV x [-u, u] I (n|p|n) > 1 - 6'}. 



Proof of step 2. If Qi(p,Pi) = 0, the minimum of the function 



{-u,u} 3 £ ^ F,, ft (0 = XX^OK^M^P))) 




for all 



(12) 



we have 



d£ 2 



< 0. 
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Therefore, for all (p,/3i) satisfying (fT2"j). £ = 0, can not minimize F Pt p 1 (£) and 
we have 

/a>n[J {(p,j9i) G P x [-u,u] I (n|p|n) > 1 - 5'} = 0. 

Therefore we get, 

loo C {(p, /Si) G X> x [-«, u] | (R|p|n) > 1 - ^'}, 

by setting 5 = 5' in Step 1, for u small enough. This finishes the proof of Step 
2. 

Step 3. u can be chosen small enough so that = {(|rz) (n| ,0)}. 

Proof of step 3. By construction of V e , we have for p — \n) (n\ and /3i = 0, 



^ 2 Jln)(n|,o(£) 



> 0. 



By continuity of d 2 F Pi/ 3 1 /d^ 2 with respect to p and fix, we can choose 5 and u 
small enough such that for all j3\ G [— u, u] and p satisfying (n|p|n) > 1 — 5 we 
have 



> o. 



<2£ 2 

This implies that on the domain 

{(p, G V x [-u, u] 2 | (n|p|n) > 1 - 8}, 

F is a uniformly strongly convex function of £ G [— u, u]. Thus the argument of 
its minimum over £ G [— u, u] is a continuous function of p and /3i. 

Now we choose 5" = min{<5/2, <5'/2}, where 5' is as in step 2. Then, taking 
a convergent subsequence of {xfcj^Li (that we still denote by Xfc for simplicity 
sakes), we have 

Xk — > (Poo, j8i,oo) G loo C {(p, ftjePx [-it, ft] I (ra|p|ra) >1-S}. 
Therefore, 

w fc = argminF Pfci(gi fc (£) -> argminF Poo i(8l >oo (£) = 0. 

In the above, we get the last equality from the fact that Qi(poo> A.,oo) = 0. 

Since /^.j. = u^-i, fi\k tends almost surely towards 0. We know that 
Q-z{Pkifix,k) and Qx{pkifix,k) tend also almost surely to (theorem IA. 1[) and 
by uniform (in p) continuity with respect to /3\ t k, Q2(p/c,0) and Qx{pk,0) tend 
almost surely to 0. Since Qi(p, 0) = Q%{p,Q) — implies p = \n) (h\, p^ con- 
verges almost surely towards \n) {n\. This completes the proof for r = 1. 
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Extension to t > 1. For r > 1 and \ = (p> Pit ■ ■ ■ > Pr), the proof is very 
similar. We still have (JTTJ) with Qi and Q2 given by formulae analogous to © 

and (TO]): /3i, fe is replaced by /3 T!fc ; M^' fc (> fe ) is replaced by K^* . . . K^- 1 .* (M^ T ' fc (p k )); 
vl T ,p h =Tr \^M^ k p k M^\ is replaced by Tr ...K^-^(M^- k p k M^)^ 

(Kraus maps are trace preserving). The analogue of technical Lemma[2]3] whose proof 
relies on similar continuity and compactness arguments, has now the following state- 
ment: 

Lemma 2.4. There exists C > such that for all (p, p\, . . . , /3 T ) 6 T> x [— u, u] T 
satisfying Qz(p, /3i, . . . , fi T ) = 0, there exists n G {l,...,d} such that p n:n > 1 — 
67(1/3x1 + |/3 2 | + ... + |/? T |). 

The last part of the proof showing that, for u small enough, the control Uk is a 
continuous function of \k when Xfe is in the neighborhood of the u;-limit set 

{xevx [-u,u] T I Qi(x) = Q 2 (x) = 0} 

remains almost the same. □ 



2.3.3 Quantum filter and separation principle 

When the measurement process is fully efficient and admits no error, the Markov 
process (J2) represents a natural choice for estimating the hidden state p. Indeed, 
the estimate /o oat of p satisfies the following dynamics 

f%-i=Kl- T 0>?) (is) 

where the measurement outcome pk is driven by ©. In practice, the control Uk 
defined in Theorem 12.21 could only depend on this estimation p% Bt replacing p k 
in Xk- If Po * = Po, then p° k Bt = pk and Theorem 1 2 . 21 ensures convergence towards 
the target state. Otherwise, the following result guaranties the convergence of 
such observer/controller scheme when ker(pg st ) C ker(po)- 

Theorem 2.3. Consider the Markov chain of state p k obeying @ and assume 
that the assumptions of Theorem \2/£\ are satisfied. For each measurement out- 
come pk given by p]). consider the estimation p k st given by (| 13() with an initial 
condition Pq* . Set Uk = f{p%\ u k-i, ■ ■ ■ , u k - T ) where f is given by (|8|). Then 
there exists u* > such that, for all u £]0,u*] and e € {),ia\ii n ^n ^ , 

Pk and plf converge almost surely towards the target state \n) (fi\ as soon as 
ker(p5 sf ) C ker(po)- 

Proof. Set p = \n) (n\. The main idea is that E (Tr (pkp~) \po, Po ') (where we take 
the expectation over all jump realizations) depends linearly on p even though 
we apply the feedback control. 

Since u k ^ T is a function of (p st , po, ■ ■ ■ , pk-T-i), we have 



E (Tr ( Pk p) \p , tf) = J2 Tr ( M w-T ( ' • • ( M mV (Po)))P 
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where M%(p) = M^pAlf. The linearity of E(Tr(p feJ o) |p ,P5 3t ) with respect 
to po is thus verified. Now we apply the assumption ker(p5 Bt ) C ker(po) and 
therefore one can find a constant 7 g]0, 1[ and a well-defined density matrix p$ 
in T>, such that 

PT = 7Po + (1 - 7)Po- 

By the assumptions of the theorem and applying the dominated convergence 
theorem: 

lim E(Tr (p k p) | ftf,ftf) = l. 

k— too 

By the linearity of E (Tr (ptp) | Po? PcT) with respect to po> w ^ have 

E (Tr (p k p) I Po\pT) = 7E (Tr (p k p) \ p , p^) + (1 - 7)E (Tr (p k p) \ p c , p^) , 

and as both E (Tr (p^p) | p 07 pQ st ) and E (Tr (p k p) | pg,/9Q st ) are less than or 
equal to one, we necessarily have that both of them converge to 1 since < 
7 < 1: 

hm E(Tr (p k p) \p ,p^) = l. 

K—>00 

This implies the almost sure convergence of p k towards the pure state p. 
We have also E (Tr (pl^p) \po, Po Bt ) depending linearly on p . Thus 

EC&OCP) I PT'PT) = 7 E ( Tr (pTp) I Po,Po st ) + (l-7)E(Tr( P rp) \po,pT), 

Using the fact that when po = pQ at , p k = p° k st for all fc, we conclude similarly 
that p k st converges almost surely towards p even if po and p^ do not coincide. 

□ 

3 Imperfect Measurements 

We now consider the feedback control problem in the presence of classical mea- 
surement imperfections with the possibility of detection errors. The imperfec- 
tions in the measurement process are described by a classical probabilistic model 
relying on a left stochastic matrix (77^ p! € {1, . . . , m'} and p G {1, . . . , m}: 

??ju',m > and for any p, X^u'=i VnW ~ ^- The integer vn! corresponds to the 
number of imperfect outcomes and is the probability of having the imper- 
fect outcome p! knowing that the perfect one is p. 
Set 

p k =E(/? fc |po,/^--->/4-i) u -r,---,Wfc- T _i) ■ (14) 
Since p k follows @ , p k is also governed by a Markov process [IT] : 

Pk+i=l^-{pk)i (15) 

where 
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• for each p! , L^, is the super-operator defined by 




with L-,(p) = E™ =1 ^',^M-pMf, 
• is a random variable taking values p! in {1, • • • , to'} with probability 

Since f^',^ is a left stochastic matrix, we have, 

m m 

e (p fe+1 ia = p, Uk _ T = u) = J2 K' (p) = E KpK ] = ku (p)> 

fl' — l 11— 1 

which is precisely the Kraus map ([T]) associated with the Markov process pk- 
By assumption [TJ each pure state \n) (n\, n £ {1, . . . , d} remains a fixed point 
of the Markov process (|T5)) when u = 0. 

We now consider the dynamics of the filter state p in the presence of delays 
in the feedback control. Similar to the case with perfect measurements, let 
Xk = {pkiPi.k, ■ ■ ■ ,/?T,fc) be the filter state at step k, where Pi.k = Uk-i is the 
feedback control at time step k delayed I steps. Then the delay dynamics are 
determined by the following Markov chain 

Pk+i -Lj' fc (p fc ) 
Pi,k+i = Uk 
< fc,k+i =Pi,k (16) 

, Pr,k+1 = Pr-l,fc- 

Instead of Assumption [2] we now assume 

Assumption 4. For all n\ ^ 122 in {1, • • • ,d}, there exists p! £ {1, • • • , m'} 
such that 

m m 

^ t 11n',fj,\ c IJ,,ni I 7^ y t Vfi' ,fj.\ c n,n 2 I ■ 
^l— 1 /I— 1 

Assumption|3]means that there exists a p! such that the statistics when u = 
for obtaining the measurement result // are different for the fixed points \n\) (ni| 
and |n 2 ) (n 2 |. This follows by noting that Tr (Ly (n|)) = X)™ =1 V,i«l c M,n| 2 
for n € {1, • • • , d}. 

We now state the main result of this Section, which is the analogue of The- 
orem [52] in the case of imperfect measurements. 
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Theorem 3.1. Consider the Markov chain (|T51) with assumptions^^ and^Q 
Take n € {1, . . . , d}. Assume that the graph of Meztler matrix R of Lemma \2.1\ 
is strongly connected. Take e > 0, a G solution of Ra = X with = 0, , 
A„ < for n G {1, . . . ,d}/{h}, A ft = — J2 n ^n e n^n (see Lemma [2. 2\) and set 

d 

V t {p)^Y. (J r l {n\p\n)-^{n\p\n)f. 

n=l 

Take u > and consider the following feedback law 

u k = argmin(E (W e (xk+i)\Xk,u k =£,))= f(Xk) (17) 

where W £ {x) = V e (K ft (K & ( K^(p) .. .))). 

Then there exists u* > such that, for all u e]0, U*] and e € 0, min,^^ (^ r 71 ) 

the closed loop Markov chain of state Xk with the feedback law (fT7|) converges al- 
most surely towards (\fi) (n| ,0, • • • ,0) for any initial condition xo£Px [— u, u] T . 



The proof of this theorem is almost identical to that of Theorem 12.21 with 
replaced by L^, and p^ p replaced by p™, ~ and we do not give the details of 
this proof. 

For estimating the hidden state p necessary to apply the feedback ([T71) , let 
us consider the estimate p" 3 * given by 

fir +1 =i$- T (8r) as) 

where p! k corresponds to the imperfect outcome detected at step k. Such p' k 
is correlated to the perfect and hidden outcome p k of (JH) through the classical 
stochastic process attached to (?y M ' lAI ): for each p k , p' k is a random variable to 
be equal to p' € {1, . . . , m'} with probability ?7 M ' iMfe . 

In practice, the control u k defined in Theorem 13 . 1 1 could only depend on this 
estimation pl st replacing p k in Xk — (pk,u k -i, ■ • ■ , u k - T ). The following result 
guaranties the convergence of the feedback scheme when ker(p^ Bt ) C ker(po)- 

Theorem 3.2. Consider the Markov chain of state p k obeying ^ and take 
assumptions of Theorem \3.1\ Consider the estimation pu* given by (|18|) with an 
initial condition p at . Set u k = f(p k 3t ,Uk~i, ■ ■ ■ ,u k - T ) where f is given by (ITTl) . 

Then there exists u* > such that, for all u g]0, u*\ and e £ 0, min n ^ ri ^ 

Pk and p k * converge almost surely towards the target state \n) (n\ as soon as 
ker(p ") C ker(p ). 



Proof. Let us first prove that p k defined by (fl4| converges almost surely towards 
|n) (n|. Since job = po, we have ker(pg st ) C ker(po)- Thus, there exist p c £ V and 
7 e]0, 1[, such that, jog* = jp + (l — 7) / 5 \ Similarly to the proof of Theorem l2.31 
E (Tr (pup) \pa, Po*) depends linearly on po : 

E (Tr (p k p) | , ft*) = 7 E (Tr (p k p) \ p , ft*) + (1 - 7 )E (Tr (p k p) \ ft, ft st ) , 
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Moreover by Theorem 13.11 E (Tr (pkp) | pg^PcT) converges almost surely to- 
wards 1, thus E(Tr (pkp) | Po,Po Bt ) and E(Tr(pfep) | /5§,/5jj at ) converge also to- 
wards 1. Consequently p^ converges almost surely towards \n) (n|. 

Since pk is the conditional expectation of pk knowing the past imperfect out- 
comes and control inputs and since its limit \n) (n\ is a pure state, pk converges 
necessarily towards the same pure state almost surely Convergence of p^T* relies 
on similar arguments exploiting the linearity of E (Tr (p% at p) \po, Po") versus po- 

□ 



4 The photon box 

4.1 Experimental system 




Figure 1: Scheme of the experimental setup. 



The system to be controlled is a microwave electromagnetic field trapped in 
a cavity C (see figure [T|) , made of two high- reflection mirrors facing each other 
and able to store photons for almost T cav ~ 0.1s. When prepared in a classical 
state, namely a coherent state \a), where a € C denotes the amplitude of the 
field, the number of photons in the field is not well defined: the photon number 
probability distribution P(n p y 1 ) is a Poisson law of parameter (n p h) = |a| 2 . 
Coherent states are robust states, that remain coherent while their energy (n p h) 
decays with a characteristic time T caN . The non-classical photon number states 
\n = ?iph + 1) = l^ph), or Fock states containing exactly n p h G N photons, are 
more fragile: a so-called quantum jump towards |rt p h — 1) occurs on average 
after time T cav /n p h only. Real-time information is then required to correct for 
these jumps and stabilize these states. 



18 



In our experiment, this information is provided by individual two-level atoms, 
whose states are denoted \g) and |e), and which are prepared in box B and sent 
through the cavity mode every time interval T p . In additional cavities Ri and 
R2, we can map these states to and from their quantum superposition which 
is sensitive to the field in C. While interacting with the cavity field, the atoms 
thus act as atomic clocks whose rate would be proportional to the number n p h of 
photons stored in the cavity: they accumulate a phase 4>(n p h) = </>o(™ph + 1/2), 
where 4>q is the dephasing per photon. Measuring each atom in the {\g} , |e)} 
basis by means of detector D gives partial information about n p h- Since the 
atomic preparation is not deterministic (Poisson process with a mean number 
of atoms per sample of (n a ) sa 0.6), zero, one or two atoms may be detected. 

A real-time computer then runs the quantum filter, which estimates the 
actual density matrix p of the field using the outcome of the measurement 
and all available knowledge on the experimental setup. Notably, it takes into 
account the limited detection efficiency e (percentage of atoms that are actually 
detected) and the detection errors r\ e and rj g {r]s^{e,g} is the probability to detect 
an atom in the wrong state). 

The controller eventually calculates through its feedback law the amplitude 
«eR (scalar control input) of the classical microwave field that is injected into 
the cavity by the classical source S so as to bring the cavity field closer to the 
target state. The succession of the atomic detection, the state estimation and 
the microwave injection define one iteration of our feedback loop. For a more 
detailed description see [16j El H] ■ 

4.2 The controlled Markov process and quantum filter 

The three main processes that drive the evolution of our system are decoher- 
ence, injection and measurement. Their action onto the system's state can 
be described by three super-operators T, ID) and P, respectively, which we dis- 
cuss in more details in the following 6 . All operators are expressed in the 
Fock-basis (|n p h))n ph =o,....ri™ h ax truncated to n™^ x photons. With notations of 
sub-section 12. 11 d = n 1 ^ + 1 and the index n = n p h + 1. 

Decoherence 

The decoherence manifests through spontaneous loss or capture of a photon to 
or from the environment. Thus, it can be sufficiently described by the action of 
the superoperator T in the justified approximation 9 = T p /T cav <til in the form 
of 

p h+ T g (p) = L oP L\ + L-pLl + L+pLl. (19) 

Here, the three operators Lq 1 L„ and L + refer to the situations where the 
photon number changes by 0, —1 and +1 after the time interval T p . They thus 
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read 



L = I - 0(1/2 + na.) N - (0n th /2) I, (20) 
L_ = v/0(l+n th )a, (21) 
L+ = y/e^a^, (22) 

where a and a* are photon annihilation and creation operators (a|n p h) = 
^/n P h |n p h— 1) and a* |n p h) = i/n p h + l |n p h + l)) and N = a' a is the photon 
number operator (N|n p h) = n p h |n p h))- Besides, nth is the mean number of 
photons in the cavity mode at thermal equilibrium with its environment. 



Injection 

The field control is realized by injecting into the cavity a small coherent mi- 
crowave field of real amplitude u e M. It is described by the unitary displace- 
ment operator D u = exp(ua^ — ua). The evolution of the state p after the 
control injection is thus modeled through 

p i y D u (p) = D uP D_ u . (23) 

Since all analysis is performed up to the second order of the control parameter 
u, we use the following approximation of the displaced state: 

n>„(p)« p-ub,at-a] + y[[p,at-a],at-a]. (24) 

In the following we therefore limit the control to u G [— u, u] with u — 0.1 <C 1. 
Besides, this approximation allows to significantly reduce the computation time 
during the real-time state filtering. 



Measurement 

In the ideal case, every detection of an atomic sample gives one of the two 
possible results p, € {e, g} and the system gets projected onto one of the two 
states given by 

M u pMl 

P» = r V (25) 
Tr (M.pMt) 

The operators M M depend on the experimental settings such as the relative 
phase (f) r between cavities Ri and R2 and the dephasing per photon <p : 

M a = cos( ^°<™ ), (26a) 
M« = sin (h±^±im . (26b) 



These measurement operators are diagonal in the photon number basis {|n p h)}, 
illustrating their quantum non-demolition nature with respect to the photon 



20 



n'\n 





9 


e 


99 


ee 


ge or eg 





1 


1-e 


1-e 


(1-e? 


(1-e) 2 


(1-e) 2 


9 









2e(l- e )(l- 77s ) 


2 £ (l- £ )r ?e 


e(l-£)(l-J7g+rj e ) 


e 









2e(l-e)r, 9 


2 £ (l-e)(l-r, e ) 


e(l-e)(l-tj e +%) 


99 











e 2 (l-%) 2 


^ 2 


e 2 ^e(l-»79) 


ge 













2e 2 Ve (l- Ve ) 


6 2 ((l-'7 S )(l-'7e)+»7 9 »7e) 


ee 











^ 9 2 







Table 1: Stochastic matrix rj^i^ showing the probability to measure outcome 
// for each ideal measurement outcome /z. Note that we cannot technically 
distinguish detection outcomes // = {ge} and \j! = {eg}. 



number operator and thus fulfilling Assumption 1. Besides, Assumption 2 can 
also be fulfilled by a proper choice of the experimental parameters 4> r and 4>o ■ 

In the real experiment, the atom source is probabilistic and is character- 
ized by a truncated Poisson probability distribution P a (n a ) > to have n a 6 
{0,1,2} atom(s) in a sample (we neglect events with more than 2 atoms). 
This expands the set of the possible detection outcomes to m — 7 values 
H € {0, g, e, gg, eg, ge, ee}, related to the following measurement operators: 



L = y/P a (0)I, (27) 

L g = y r Kil)M g , 

L e = ^P a (l)M e , 

L gg = ^[PMM 2 g , 

L ge = ^P a {2)M g M e , 

L eg = ^P a (2)M g M e , 

L ee = ^JP a {2)M 2 g . 

Finally, the real measurement process is not perfect: the detection efficiency 
is limited to e < 1 and the state detection errors are non-zero (0 < rj e / g < 1). 
These imperfections are taken into account by considering the left-stochastic 
matrix rj^^, given in Table [1] [17]. Consequently, the optimal state estimate 
after measurement outcome \j! gets the form 

State estimation 

The last experimental complication to be taken into account is the spatial sep- 
aration of the cavity C and the detector D, which results in r atomic samples 
flying between them at every moment. This introduces a delay in the field con- 
trol since a sample measured at step k has been in the cavity just after control 
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injection made at step k — r. After taking into account our full knowledge about 
the experiment, we finally get the following state estimate at step k + 1: 

= P^(B Ufe _ T (T,(^))) = l^- 6 (pt). (29) 
4.3 Feedback controller 

For = (no cavity decoherence) the Markov model of density matrix p associ- 
ated to the filter (|2"9")l is exactly of the form (TIB]) with (|ra p h) (n P h|)n ph =o,...,n™ ax 
being fixed-points in open-loop. Similarly, the underlying Markov process of 
state p, the true cavity state unobservable in practice because of detection im- 
perfections and delays, admits the same fixed-points. With parameters given in 
sub-section !4.4l (except 9 = 0), these Markov processes satisfy assumptions Q][5][3] 
for p and assumptions [TJ3J3] for p. Moreover the Metzler matrix R of Lemma |2~21 
is irreducible since its graph coincides with the graph of a^ a, a tri-diagonal op- 
erator. Consequently the assumptions of theorem l3.1l are satisfied. The feedback 
law proposed in theorem !3.2l and relying on the filter state p^ at will stabilize glob- 
ally the unobservable state p towards the goal photon- number state \n p h) (%h|- 
Numerous closed-loop simulations show that taking e = in the feedback law 
does not destroy stability and does not affect the convergence rates. This ex- 
plains why in the simulations and experiments, we set e = despite the fact 
that theorem 13.21 guaranties convergence only for arbitrary small but strictly 
positive e. 

For 9 positive and small, the |n p h) (^ P h|'s are no more fixed-points for p and 
p in open-loop. Nevertheless, closed-loop simulations and experimental data 
of figures [3] and [4] indicate that a slight adaptation of the feedback scheme of 
theorem !3. 21 steers and maintains p and p close to the target |n p h) (^ P h| between 
photon-number jumps induced by cavity decoherence (jump operators L + and 
L_ given in (I2U1) '). The feedback of theorem 13.21 appears to be robust enough to 
compensate such cavity decoherence jumps induced by a finite lifetime of the 
photons. 

Let us detail how to adapt the feedback scheme of theorem !3.2l At each step 
of an ideal experiment the control u k minimizes the Lyapunov function Vo(pk) = 
J2n ph CT ™ P h ( n phl Pk \ n ph) (e is set to zero) calculated for state p k = B Uk (p k ). In 
our real experiment however, we also take into account decoherence and r flying 
not-yet-detected samples and therefore choose u k to minimize Vo(pk) for 

p k = B Uk (T 9 (K^-^ e (K^-^ e (. . . K" fe — e G5T) • • •)))) 

with p" 33 * given by (f29|) . Here we have introduced the Kraus map of the real 
experiment 

m' rn 

V3 P ^ K u > e (p) =J2J2 V,'MB u (T e (p)))L^ e V. (30) 

fl' — l f-L— 1 
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Figure 2: Coefficients of the Lyapunov function Vo(p) used in the feedback for 
the simulations of figure [3] and for the experiment of figure [4] 



To simplify the minimization of Vq we use approximation (|24|) and get 

V (D u (p)) a VoGo) - (am + a 2 ^ (31) 

with ai = Tr([a^ — a, <tn]p) and a 2 — Tr([[a^ — a, on], a^ — a]p), where ctn is the 
diagonal operator J2 n h an ph l n ph) (%>h|- The coefficients er„ ph are computed 
using Lemma 12.21 where, for n p h ^ n P h, A„ ph are chosen negative and with 
a decreasing modulus versus n p h in order to compensate cavity decay. For 
"■ph = 3, we have compared in simulations different setting and selected the 
profile displayed in figure [2] The control u minimizing Vq is then approximated 

by 

argmax (a^ + a 2 £ 2 /2) . 
fe[-u,Tt] 



4.4 Simulations and experimental results 

Closed-loop simulation of figure [3] shows one typical Monte-Carlo trajectory of 
the feedback loop aiming on the stabilization of the 3-photon state |n p h = 3). 
The typical values of experimental parameters used in the simulations are the 
following: ra£g* = 8, <j) = 0.245 tt, <j) r = vr/2 - <j) (n ph + 1/2), (n a ) = 0.6, 
e = 0.35, r/ e = 0.13, i] g = 0.11, T p = 82 /is, T cav = 65 ms, nth = 0.05, and 
t = 4. For the feedback, cr„ ph are given in figure [5J e = and u = 1/10. The 
initial states po and pjj st coincide with the coherent state with n p h = 3 photons: 
D=-(|0)(0|). 



23 




20 40 60 80 100 120 140 160 

Time t (ms) 



Figure 3: Simulation of one Monte-Carlo trajectory in closed-loop with 
the target state |n p h = 3). (a) Detection results. (b) Control Lya- 
punov function, (c) Control input, (d) Estimated photon number prob- 
abilities: E, lph<fiph ( n phl P° st \ n ph) in red > ("phi P° st \n P h) in thick green, 
and Y^n b >n j ( n phl P° at \ n ph) i n blue. (e) Cavity photon number proba- 
bilities: E, lph<ftph (%>h| P l n ph) in red , (n ph \p\n ph ) in thick green, and 
E„ ph >r- lph M P Kh> in blue. 
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The results of the experimental implementation of the feedback scheme are 
presented in figure [4] figure QJe) shows that the average fidelity of the tar- 
get state is about 47%. Besides, the asymmetry between the distributions for 
riph < n p h and n p h > « P h indicates the presence of quantum jumps occurring 
preferentially downwards (n p h — ► nph — I). Contrarily to the simulations of 
figure [31 the cavity photon numbers relying of p are not accessible in the ex- 
perimental data of figure [4] since we do not have access to the true detections 
and to the cavity decoherence jumps. Nevertheless, green curves in figures E^d) 
andEUJe) indicate that when (n p h| p^ Bt \n p h) exceeds 8/10, p coincides, with high 
probability, with the goal state |n p h) (n p h\- 
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Figure 4: Single experimental trajectory of the feedback loop with the tar- 
get state |n p h = 3). (a) Detection results. (b) Evolution of the control 
function. (c) Control injection. (d) Estimated photon number probabil- 
ities: J2n ph <7i uh ("phi P° st \ n ph) in red , (n ph \ p^ 1 |n ph ) in thick green, and 



E 



(n p h\ p""* \n p h) in blue, (e) Estimated photon number probabilities 



•^"■ P h>Hph v-H"ii r-H"/ - ■ \ / - j. - - - j 

averaged over 4000 experimental trajectories of the feedback loop. 
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5 Conclusion 



Wc have proposed a Lyapunov design for state- feedback stabilization of a discrete- 
time finite-dimensional quantum system with QND measurements. Extensions 
of this design are possible in different directions such as 

• replacing the continuous and mono-dimensional input u by a multi-dimensional 
one (tti, . . . ,u p ); 

• assuming that u belongs to a finite set of discrete values; 

• taking an infinite dimensional state space as in |18) where the truncation 
to finite photon numbers is removed; 

• considering continuous-time systems similar to the ones investigated in [12] ; 

• ensuring convergence towards a sub-space instead of a pure-state and thus 
achieving a goal similar to error correction code as already proposed in pQ . 
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A Appendix 



Theorem A.l. Let X k be a Markov chain on the compact state space S. 
Suppose, there exists a non-negative function V(X) satisfying 

E(V(X k+1 )\X k ) - V(X k ) = -Q(X k ), (32) 

where Q(X) is a positive continuous function of X, then the w-limit set (in 
the sense of almost sure convergence) of X k is included in the following set 

Ioo = {X\ Q(X) = 0}. 



Proof. The proof is just an application of the Theorem 1 in [TTJ Ch. 8] , which 
shows that Q(X k ) converges to zero for almost all paths. It is clear that the 
continuity of Q(X) with respect to X and the compactness of S implies that 
the w-limit set of X k is necessarily included into the set 1^ . □ 



Proof of Lemma 12.31 The condition Q2 = implies that, for all n, p, v 

Tr (MppMf^ (n| M*pM* f \n) = Tr (M*pM* f ) (n\ MfrpM^ \n). 
Taking the sum over all v, we get, for all n and p, 

(n| M^pM^ \n) = Tr (m^ pM^) (n\ K^(p) \n) 
Since M^ 1 and K^ 1 are C 2 function of /3i, these relations read {p n ',n' — ( n '\p \ n ')) 



\c^,n\ 2 Pn,n = \ c fi,n' \* Pn' ,n' J Pn.n + Pi b^ n {p,p\) (33) 

where the scalar functions b^ n depend continuously on p and $\. 

Let us prove the lemma by contradiction. Assume that for all C > 0, there 
exists (p c ,Pi) G V x [— u, u\ satisfying Q2(p C ,Pi) = 0, such that 

Vne{l,...,d}, p° n <l-C\^\. 

Take C tending towards +00. Since p c and /3f remain in a compact set, we 
can assume, up to some extraction process, that p c and /3p converge towards 
p* and PI in V and [-u,u\. Since |/3p| < (1 - p% n )/C < 1/C, we have P\ = 0. 
Since 



\^,n\ 2 pZn = fE | 2 P«',n') <°«.« + 6 M,»(p°',/9l') (34) 

we have by continuity for C n- +00 



l c M,«l Pn.n — I l C ^'™' 
\ n' 



2 * I * 
Pn'n' I rn.n 
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for all n and p. Thus there exists n* £ {l,...,d} such that p* = \n*) (n*\ (see 
the proof of Theorem 12. Since p* n *_ n , = 1, for C large enough, p^» in » > 1/2 
and thus 

El^'l 2 ^"') =K,n.| 2 -/jf ^"l^'^^ 05) 
„, / Pn*,n* 

Taking n =^ n* , by Assumption [5J there exists p such that |c Mi „| 2 7^ |c Mi „*| 2 . 
Replacing (|35|) in (|34|) yields 

(|C,,„| 2 - |C^-| 2 + Pin = P? b,AP°\P?) 

Thus, there exists Co > 0, such that for n 7^ n* and C large enough 

p£„<c |& c |. 

But Pn* 7 n* — 1 — Z) n /„* P«,n > 1 ~ C (d ~ |. This is in contradiction with 
Pn«,n* - 1 ~ C|/3p | as soon as C > C (d - 1). 
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